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In  their  beck.  Box  and  Jenkins  (1970)  strongly  recommend  that  forecasts 
of  ARL'.IA  processes  be  cade  using  the  Difference  Equation  form  because  it  is 
the  simplest  approach.  All  subsequent  textbooks  have  endorsed  this  view  to 
the  extent  that  very  few  of  them  consider  forecasting  in  the  Integrated  Form 
in  any  detail.  A  notable  exception  to  this  is  the  recent  book  by  Abraham  and 
ledolter  (1982)  which  contains  some  useful  detailed  discussion  of  the  role  of 
the  Eventual  Forecast  Function  (EFF)  in  generating  forecasts.  The  purpose  of 
this  note  is  to  indicate  that  the  Integrated  Form  does  have  value  and  is  wort: 
of  consideration.  <Ve  begin  with  a  brief  review  cf  the  derivation  of  the 
necessary  components  for  this  form  of  the  forecast. 

THE  I.'iTEJRATED  FORM  OF  THE  FORECAST 
Suppose  {X.}  is  an  ARIMA  process  satisfying  aiBIX,  =  &(5;ak 

> 

where  B  is  the  Backshift  Operator  which  is  defined  by  3"X+  =  X.  and 


?  >  q 

where  a(3)=  a,  S'*  ,  8(3)  =  l 
k=0  k=0 

difference  terms  are  included  in  the 

T  >  q  ,  7  a  X*(T-k)  =  0  ,  where 


k  =0 

X  .  and  takes  the  value  Xt  .  if 
t+i  t*i 


8^3^  ,  ciq  s  8q  =  1  and  ail  seasonal  and 
a-  and  3-  operators.  For  lead  times 
Xt(i)  is  the  forecast  mace  at  time  t  o: 
i  <  C  .  The  solution  of  this  different 


equation  is  the  EFF.  It  can  be  written  as 

2  t 

Xf(T)  =  L  b“f  (T) 
k-i  '*  ' 

where  f,^(T)  are  deterministic  functions  of  T  and  may  include  pclyr.c 
exponentials,  sinusoids  and  products  of  these.  They  may  also  be  dummy 
variables  generating  a  seasonal  pattern. 


(i: 


As  a  representation  cf  the 


T-steo  ahead  forecast  (1)  above 


valid  for 


where  {e  }  is  the  sequence  of  one-step  ahead  forecast  errors,  i.e. 
et  =  Xt  “  Xt-l(1)  * 

Box  and  Jehkins  (1970)  show  that  the  current  value  of  =  (b^,b^, . . ,b^) ' 
may  be  obtained  from  b^  ^  via  a  linear  equation  _b+  =  Lb+  ^  +  he  f  .  The 
matrix  L  effects  the  changes  in  the  coefficients  in  revising  the  tine  origin 
from  ( t — 1 )  to  t  ,  and  can  be  obtained  as  L  =  F ,,dF,.  ,  where  F,,  is  the  (o*o) 

matrix  with  (i,j)th  element  f. (M+i)  .  The  vector  h  =  ,  where 

-1  =  ^M*-l  '^'1+2 '  ’^M+p  ^  '  and  C'k  i3  the  coefficient  cf  3*  ir- 
o(3)  =  |3(3)/a(3)  ,  the  usual  moving -average  representation  of  the  AF.IMA. 

process. 

V/e  may  note  that  the  revision  matrix  L  is  block  diagonal.  Each  block 
corresponds  to  a  real  (repeated)  linear  factor  or  conjugate  pairs  of  complex 
factors  in  a(3)  .  The  effect  of  this  structure  is  that  individual  components 
can  be  monitored,  revised  and  projected  independently  of  the  others.  Thus,  for 
example,  any  linear  trend  and  seasonal  factors  can  be  obtained  at  each  time  t 
given  only  their  values  at  time  (t-1)  and  the  latest  forecast  error  eA  . 
Details  of  the  derivation  of  the  revision  equations  for  the  linear  trend  and 
seasonally  differenced  ARIMA  model  forecasts  are  given  in  McKenzie  (1984b). 

They  can  be  obtained  without  iirect  evaluation  and  inversion  of  the  matrix  F,,. 

If  we  wish  to  generate  not  merely'  individual  components  cf  the  EFF  but  the 
forecast  itself  using  the  Integrated  form,  we  crust  use  (1)  or  (2)  above.  If 

M  >  C  ,  it  is  necessary  to  obtain  {ci,  ,  :  i  =  G,1,...,M-T  ;  T  =  1,2 . ’.!}  . 

Box  and  J 


eser.t  here  a  general  solutio 


ammeters 


3 


The  derivation  of  this  result  is  given  in  an  Appendix  at  the  end  of  the  paper. 

There  it  is  shown  that  cL  .  =  d_  .  .  .  ,  (j  =  1,2,...,M-T  ;  T  =  1,2,...,}.!). 

i  >  J  » J  — ■*■ 

Thus,  it  is  necessary  to  obtain  only  {d^  ^  :  T  =  1,2, . . . ,M} ,  and  these  are 
easily  derived  from  the  following  matrix  equation.  ’ 


a  a  ,  a  »  . . . 
P  P-1  P-2 


1,0 


‘  apJ  LdM'qJ 


The  triangular  nature  of  these  equations  evidently  makes  them  particularly  easy 
to  solve.  This  is  especially  true  when  we  note  that  M  =  q  -  p  is  very  rarely 


large. 


3ox  and  Jenkins  give  some  r.on-seascnal  examples  in  their  book  and  a  sea¬ 


sonal  one  is  given  in  McKenzie  (19S4a)  expressing  the  well-known  airline  model 
forecasts  in  current  level,  gradient  and  seasonal  factor  form.  7/e  give  one 
other  brief  example  here  to  illustrate  the  evaluation  of  the  d-coef f icients. 


Consider 


(1  -  >t 5 )  ( 1  -  B)X+  =  (1  -  0En)at  . 


New,  M=q-p=4-2=2  .  Thus, 


»  .  r °i 


L°  4  JlitoJ  b 


i.e.  i. 


2 

£•{!  +  : . 
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(ii)  Forecast  Equation 


X*(k)  =  b?  +  kb? 

w  O  1 


k  >  1 


There  is  a  fixed  computational  investment  in  generating  b+=  (b  ,b ,  ) 
before  any  forecast  can  be  obtained.  Cnee  this  is  achieved,  however, 
forecasts  for  any  lead  times  are  easily  derived  using  equation  (5).  All  the 
storage  requirements  relate  to  the  revision  equations  (4),  and  here  involve 
b  ,  h.  ,  h,  and  e  . 

X  "■  ±  X 

For  the  Difference  Equation  form,  the  forecasts  are  generated  directly: 

XtQ)  =  Xt_x(2)  *  Vt 

X*(2)  =  X.  ,  (3)  +  (6) 

o  X  ”  x  i j 


Xt(k)  =  2Xt(k-l)  -  Xt(k-2) 


s.  >  3 


Mote  that  there  are  no  revision  equations  which  do  not  generate  forecasts. 
However,  it  is  clear  that  the  forecasts  of  lead  times  k=l,2  and  3  play  a  role 
in  equations  (6)  and  (7)  similar  to  that  of  b^  in  equations  (4)  and  (5). 
Another  important  point  here  is  that  to  obtain  forecasts  for  any  lead  time 
using  (6)  and  (7)  we  must  first  generate  the  forecasts  for  ail  shorter  lead 
times . 

To  illustrate  the  differences  between  the  two  approaches  we  consider  some 
simple  forecasting  scenarios.  Suppose  we  wish  to  forecast  for  lead  times  k  = 
1,2,. ..,7.  The  computational  requirements  are  clearly  comparable.  The 
Integrated  Form  requires  less  storage  to  generate  the  forecasts,  but  the 
revision  equations  necessitate  a  little  mere  arithmetic  than  the  use  of  (6) 
and  (7).  Suppose  now  that  the  lead  times  of  interest  are  not  consecutive. 

For  example,  suppose  we  wish  forecasts  for  lead  times  k=I,2,3,c,12  and  13. 
Using  the  Integrate i  Form,  we  need  to  generate  only  these  six  forecasts,  once 


b_  has  been  obtained.  However,  the  Difference  equation  fora  requires  the 
generation  of  all  18  forecasts.  Alternately,  we  nay  routinely  forecast  for 
lead  tines  1,2  and  3  and  occasionally  require  forecasts  for  others,  e.g.  6,1 
18  etc.  Again,  it  is  easier  to  generate  these  via  (4), (5)  than  (6),;?). 

Another  prediction  of  common  interest  is  the  cumulative  forecast,  i.e. 

rn  T* 

the  forecast  of  Y  (T)  =  ^  X...  .  It  is  given  by  Y*(7)  =  7  X, (k).  Using  t 

t  k;1  -  k=1  +' 

Difference  Equation  form  we  must  generate  X^(k)  for  k=l,2,...,T  and  then  sun 

them.  For  the  Integrated  Form,  we  can  use  a  single  forecast:- 

m 

YJT)  =  7  ;b“  -  kb’;)  =  Tb*  +  ST;T*i)b? 

.  o  -  o  l 

These  comments  apply  equally  well  to  other  situations  in  which  linear 
functions  cf  future  values  are  to  be  predicted.  They  are  usually  more  easil 
treated  using  the  Integrated  Form. 

There  is  no  suggestion  here  that  one  approach  is  always  better  than  the 
ether.  Rather  that  each  has  certain  advantages  in  certain  situations.  The 


investment  m  tr.e  .-.evasion  louatT 


r n "t. tr ^ r'-'i  u 5 d  Form 


flexibility  in  forecast  generation  and  a  variety  of  forecasting  problems  are 
thus  more  efficiently  treated  by  this  approach.  To  assess  the  cost  of  such 
investment,  note  that  the  dimension  cf  the  vector  is  the  order  of  the 
autoregressive-difference  operator  in  the  A.RI.MA  process,  e.g.  p+c)  in  an 
ARIMA i p, d, q .•  process.  Thus,  revision  cf  b+  is  roughly  comparable  t-  the 
production  of  forecasts  for  the  first  (n+d)  lead  times  using  the  Difference 

beyond  (r+d 


ef:  icient  m  genera x 


etter  aorrea; 


a 


On  the  other  hand,  there  is  a  most  important  advantage  enjoyed  by  the 
Integrated  Torn.  It  lies  in  the  area  of  interpretation.  This  is  an  aspee 
forecasting  which  should  not  be  underestimated.  7/e  may  recall  Stem's 
comments  (1974)  that  managers  who  require  forecasts  may  be  prepared  to  acc 
trends  and  seasonal  effects  because  these  correspond  to  familiar  ideas.  T 
view  is  merely  one  cf  nary  such  cris  de  coeur  still  heard  by  forecasters, 
true  value  of  a  forecast  is  not  invested  solely  in  its  accuracy  but  also  i 
its  credibility.  The  former  can  be  assessed  only  after  the  event  forecast 
whereas  the  latter  will  determine  whether  the  forecast  is  used  at  all.  Th 
modelling  of  time-series  as  A?. IMA  processes  is  new  a  common  practice,  than 
to  the  availability  of  a  variety  of  powerful  computer  packages,  Eeverthel 
a  major  hurdle  for  most  users  is  still  the  practical  interpretation  of  the 
forecasts. 

The  reason  that  the  Integrated  form  enjoys  an  advantage  here  is  that 
generates  the  forecast  via  the  Eventual  Forecast  Function  (IFF!.  This  is 
linear  combination  cf  deterministic  functions  of  the  lead  tim.e  T.  For  fixe  : 
*vn r>t ’  — - 3  are  kr.cwn  exactly,  but  their  coefficients  in  the  linear 
pt  with  each  new  observation.  Thus,  in  the  ARIMA ,  1 , 2 ,  * 

F  is  given  by  equation  .'5).  The  deterministic  functions  are 
)=i  ana  :„(T)=T.  Their  coef f icients,  the  components  cf  are  revised 
bservatior.  via  equations  !4).  The  r.ew  observations  allow  the 
rt.  while  rreserving  the  basic  structure  of  the  EFF. 


unistic 
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revision  equations  yield  current  values  for  the  non-seasonal  component  £  = 

(bt,b!'),  and  the  seasonal  component  3  =  (3^,S^, . . . ,S^_ ) .  These  ccmocnents 

O  I  — Z  i  c  x.c. 

have  useful  and  readily  understood  interpretations.  In  the  ncn-seascr.al 

comrcnent,  b  is  the  current  level  of  the  crocess,  and  b“  the  current  gradient 

of  the  linear  trend,  i.e.  the  predicted  rate  of  change  in  level  per  month.  In 

the  seasonal  component,  is  the  additive  seasonal  factor  for  month  (t+k).  It 

i C 

predicts  the  amount  by  which  the  data  will  deviate  from  the  process  level  k 
months  from  now  (time  t). 

Exhibit  1  about  here. 

Exhibit  I  displays  the  data  for  the  years  1968-75,  ana  the  corresponding 
values  cf  the  current  level.  The  values  of  the  gradient  are  not  .displayed,  but 
follow  a  path  similar  in  shape  to  that  of  the  level,  though  with  a  different 
vertical  scale.  In  January  1968,  with  a  data  value  of  95.  we  find  the  current 
level  is  *.1.8  and  the  gradient  0.367.  Thus,  in  this  month,  the  nor. -seasonal 
component  of  the  forecast  for  T  months  ahead  is  given  by  (41.8  +•  0.C67T/. 
Clearly,  as  the  data  evolved,  this  proved  to  be  optimistic,  as  the  trend  during 
the  next  two  years  appears  to  be  downwards.  This  is  reflected  in  the  January 
1970  gradient  0.027,  which  is  smaller,  but  still  positive.  From  equation  (S), 
we  can  see  that  the  gradient  adapts  only  slowly  with  each  new  observation.  In 
January  1971  and  1979,  the  gradient  adapts  to  the  increasing  level,  with  values 
0.179  and  C»29o  respectively.  In  the  latter  month,  the  predicted  linear  trend 
in  the  process  T  months  ahead  is  (59.7  ♦  C.096T).  As  Exhibit  1  clearly  shows, 
this  would  have  been  a  very  poor  long  term  prediction.  However,  by  January 
1974,  the  predicted  trend  is  (41. o  -  C.061T).  This,  in  the  event,  may  be  a 
little  pessimistic.  It  is  important  to  note  that  not  only  have  we  been  able  to 


quote  the  non -seasonal  component  without  reference  to  the  seasonal  one  hut, 
from  equations  (3)  arid  (9),  we  can  calculate  them  separately.  Thus,  for 
example,  we  could  forecast  using  the  Difference  Equation  form  and  monitor  t 
level  and  trend  directly  via  (3). 

Exhibit  2  about  here 

The  values  of  the  seasonal  component  are  revised  with  every  cbservatic 
and  can  be  plotted  in  the  same  way  as  the  non -seasonal  one.  Since  it  invol 
twelve  paths  and  they  are,  in  fact,  very  stable,  they  are  net  reproduced  he 
It  is  just  as  revealing,  in  this  case,  to  plot  the  current  seasonal  co.mpcne 
of  the  data,  i.e.  X+  -  bj.  This  is  displayed  in  Exhibit  2.  There  is  a  ver 


cat' tern* 

.til 

August  are 

the  sar.e  • 

The  ether  months  a 

re 

equal  Ip.- 

exception 

of  December,  which 

•r'irs  tc  ce  on 

downward  trend.  It  is  interesting  to  note  that  when  a  subset  of  these  data 
to  '.lav  1975)  was  analysed  earlier  by  hi  liner  and  Tiao  ;  1 979 :  those  authors 
del  in  which  the  seasonal  component  was  deterministic.  The 
M'ity  of  such  a  model  is  reflected  in  Exhibit  2.  We  would  have  mu' 
der.ee  in  predictions  of  the  seasonal  component,  3 . ,  of  this  for ee 

O*"'  -omqqr'.rc  1  -op 
— «* *  *  o  ~  | 

c  f  n  l 4  ^  V 

urpese  of  this  paper  is  to  make  a  case  for  the  usefulness  of  the 
Form  of  an  AH IMA  forecast.  A  brief  review  cf  its  derivation  is 
A  new  method  for  obtaining  an  essential  mart  of  it  is  riven  in 
his  form  of  the  forecast  has  often  beer,  neglected  ir.  the  literatu. 
sene  useful  i2-»  T he £ 0  -ierive  t'rcrr.  1*3  -uirech  us v  r* 

orecast  F’unctior..  In  particular,  we  have  emphasised  two  points. 


First,  it  need  not  be,  as  is  often  supposed,  computationally  inferior  t 
the  Difference  Equation  approach.  Secondly,  it  offers,  via  the  IFF, 
considerable  advantages  for  interpretation  of  the  forecast.  In  both  cases,  t 
argument  is  illustrated  with  an  example. 
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APPENDIX 


Derivation  of  the  d-coef f icients 


First,  note  that  substituting  equation  (2)  into  the  usual  forecast 
revision  identity  X,  (T)  =  X.  ,  (T+l)  +  and  using  the  revision  equation 

■  Lb  ,+  he  yields  ,  =  d  ,  (j  =  1,2, . . . ,V-7  ;  T  =  1,2,...,M). 

Thus,  it  is  necessary  to  obtain  only  {cL,  n  :  T  =  1,2,...,’.!}  ani  this  nav  be 

i ,  J  ' 

achieved  as  follows. 

Applying  the  usual  conditional  expectation  arguments  to  X  to  derive  T-step 
ahead  forecasts  for  T  =  p+I,  p+2,...,q  ,  yields 


V--  "  A,  ®p+r*iet-i 


Using  equation  (2)  and  the  fact  that  L  af.(k)  =  C,  the  left-hand  side  of  (11) 

.  ,,  K  1 

K-J 

may  be  written  as 


{-X.  '•  p+r->: )  -  :  h.'f .  \p*r-'.<}} 


-  roa x(0,  p+r-M)  .  A  further  application  cf  equation 


n  M-r+k-r- 


which  is  to  be  equated  to  the  right-hand  side  of  (11). 
Equating  coefficients  cf  e  ,  for  i  =  0,1,..., M-r  ,  we  fi: 


i  yields 


A.  ^  ^  i.  —  f  *  .  . 


where  =  rr.axiO,  p*r-).l*i )  ,  for  0  <  i  <  M-r  ,  1  <  r  <  V. 
equations  (13;  it  is  easy  to  derive  the  matrix  equation  . 


on  these 


■ a i 
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